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Field of the Invention 

This invention relates generally to the field of data transmission and more 
particularly to a method and apparatus for maintaining a desired amount of data in a 
5 buffer. 

Background of the Invention 

The goal of any communication system is to transmit real-time signals from one 
location to another. Currently, there are two types of networks that can be used to convey 

10 real-time media signals; circuit switched networks and packet switched networks. A 
circuit switched network provides a dedicated point to point communication path 
between two or more users. A media signal is transmitted over the dedicated circuit, 
received by the other side and played out to a user. A packet switched network, in 
contrast, divides a message to be sent into data packets, sent individually over the 

15 network, and reassembled at a final location before being delivered to a user. To ensure 
proper re-assembly of the blocks of data at the receiving end, various control data, such 
as sequence and verification information, may be appended to each packet in the form of 
a packet header, or otherwise associated with the packet. At the receiving end, the 
packets are then reassembled and transmitted to an end user in a format compatible with 

20 the user's equipment. 

Packet switched networks are now competing with conventional circuit switched 
networks to provide interactive communications services such as telephony and multi- 
media conferencing via the Internet. This technology is presently known as internet 
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telephony, IP telephony or, where voice is involved, Voice over IP (VoIP). In VoIP 
networks, audio signals are digitized into frames and transmitted as packets over an IP 
network. The transmitter may send these packets at a constant transmission rate. An 
appropriately configured receiver will receive the packets, extract the frames of digital 
5 data and convert the digital data into analog output using a digital to analog (D/A) 

converter. Although the packets are transmitted at a constant data rate, packets will not 
necessarily arrive at their destination at a constant rate. Rather, because of variable 
delays through the network, and different transmit paths taken by packets, there is a 
packet delay variation (pdv) at the receiver. Because a digital audio data (for example a 
10 digitized voice conversation) must be played out at a constant output rate in order to 
reconstruct a high quality audio signal, the delay variation between packets is 
undesirable. 

A known solution for this problem is to implement a jitter buffer in the receiver. 
A jitter buffer is a buffer that stores frames as they are received from the network, and 

15 outputs them at a constant output rate, thus absorbing the packet delay variation. As long 
as the average rate of reception of the packets is equal to the constant output rate, the 
jitter buffer allows the packets to be output at the constant output rate even though they 
are not necessarily received at a constant rate. The jitter buffer by its nature introduces 
delay into the communication path; that is there is a delay while the packet travels 

20 through the jitter buffer until it is processed, or 'played out' at the receiver. The delay 
between receipt of the packet and the play out of the packet is referred to hereinafter as 
the play out time offset of the receiver. 
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In the context of interactive real-time communications such as internet telephony, 
delay is particularly problematic, since participants to such communications expect the 
network connection to simulate immediate, in-person interaction, without delay. Provided 
with more than a maximum tolerable end-to-end delay (a matter of design choice), 
5 conversation participants may be faced with the unsettling experience of having to wait 
some time after one person speaks before the other person hears what was spoken. 
Consequently, in most telecommunications networks carrying real-time media signals, 
there is a need to reduce or minimize the total end to end transmission delay. One 
method of doing so is to control the size of the jitter buffer. 

10 The jitter buffer should be large enough to store a sufficient number of packets to 

insure that the slowest data in an audio sequence has sufficient time to arrive at the 
receiver before playback. Too small a jitter buffer can give rise to packet loss, which 
produces audible pops and clicks and other distortion. However, a large jitter buffer 
increases the playout time offset, resulting in echo and talker overlap in the received 

15 signal. 

Several different methods have been used to adapt the jitter buffer size during 
operation to ensure optimum capture of packets while minimizing delay. For example, in 
one system, the variation of packet level in the jitter buffer is measured over a long 
period of time, and the jitter buffer size is incrementally adapted to match the calculated 
20 jitter. Such a system works well in transmission networks that provide consistent packet 
performance, such as Asynchronous Transfer Mode (ATM) networks, but are not as 
useful in systems with highly variable packet inter-arrival times. 
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A second approach for adapting the jitter buffer size is to count the number of 
packet that arrive late, and create a ratio of these packets to the number of packets 
successfully processed. The ratio is used to adjust the jitter buffer to target a 
predetermined allowable packet ratio. This approach works best with networks having 
5 highly variable packet inter-arrival times, such as IP networks, but is not as efficient in 
system having consistent packet arrival times, such as ATM networks. 

It would be desirable to identify a method and system for adaptively selecting 
jitter buffer size that is useful in a variety of transmission systems. 

10 Summary of the invention 

The present invention provides a method for dynamically adjusting jitter buffer 
size according to buffer fill dynamics. According to one embodiment, an upper threshold 
and lower threshold for the jitter buffer are identified, wherein the lower buffer threshold 
identifies a minimum desirable number of packets in the jitter buffer, and the upper 

15 buffer threshold identifies a maximum desirable number of packets in the jitter buffer. At 
any time, if the jitter buffer is only storing a number of packets that is below the lower 
threshold when it is time to play out a received packet sequence, then the time offset is 
considered too small. An adaptive controller causes dummy packets to be inserted in the 
output data stream to permit the jitter buffer to fill to a desirable level. If the buffer 

20 always has packets above the upper threshold, then the play out time offset is considered 
to be too large. The adaptive controller reduces the number of packets maintained in the 
buffer to below the threshold by discarding a number of packets at the head of the buffer 
or speeding up playout, thereby reducing the playout time offset. The ideal buffer size is 



4 



Attorney Docket No. 120-332 
Nortel Ref. No. 16267ROUS01U 
Ouellette et al. 

somewhere between the two thresholds, and the adaptive algorithm ensures that the 

number of packets is within threshold bounds. 

According to one aspect of the invention, and apparatus includes a controller, a 

buffer, coupled to the controller, the buffer having a target fullness associated therewith 
5 and providing the controller with current fullness indications; and wherein the controller 

includes means for collecting the current fullness indications and means for periodically 

updating the target fullness of the buffer in response to the current fullness indications. 

According to another aspect of the invention, a method for adapting a size of a 

buffer including the steps of monitoring a fullness of the buffer to track fullness 
10 characteristics of the buffer and periodically adjusting the size of the buffer in response to 

the fullness characteristics of the buffer. 

According to a further aspect of the invention, a method for adapting a size of a 

buffer capable of receiving packets includes the steps of identifying a target fullness of 

the buffer, the target fullness indicating a desired number of packets to be stored in the 
15 buffer, assigning an upper threshold and lower threshold to the buffer, the upper 

threshold indicating a maximum number of packets to be stored in the buffer, and the 

lower threshold indicating a minimum number of packets to be stored in the buffer, 

monitoring the buffer for a time interval to obtain fullness characteristics of the buffer; 

and adjusting the target fullness of the buffer in response to the fullness characteristics of 
20 the buffer . 
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Brief Description of the Figures 

Figure 1 is a block diagram of a device, such as a voice or video decoder, in 
which the adaptive jitter buffer controller of the present invention may be implemented; 

Figure 2 is a flow diagram illustrating exemplary steps that may be undertaken to 
5 monitor and record dynamic jitter buffer characteristics; 

Figure 3 is a flow diagram illustrating exemplary steps that may be taken by the 
adaptive jitter buffer controller of the present invention to maintain jitter buffer size 
between desired thresholds. 



10 Detailed Description 

Referring now to Figure 1, a block diagram of a device 10 that may be included in 
a receiver in a packet based network is shown. The embodiment described below will 
refer to packets that are received in a Voice Over Internet Protocol (VOIP) network 
device, however it should be appreciated that the below concepts may be expanded for 

15 use in any system whereby units of data are received at a device and it is desirable to 
monitor and update the characteristics of the buffer that stores the received units of data. 
Accordingly, the use of the terms 'packet' and VOIP network are exemplary only, and 
not a limitation of the present invention. 

In addition, it should be noted that the attached description and drawings refer to 

20 several functional blocks or flow diagram steps that each perform a described function. 
The delineation of functionality between the blocks or steps has been described herein for 
exemplary purposes only, and it should be understood that the same functionality may be 
obtained through the use of differently arranged components and processes. 
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Accordingly, the present invention is not limited to the discrete components or flow 
diagram steps, but rather encompasses all equivalent method or means for performing a 
similar function. 

In a VOIP network, the device 10 may be a voice decoder or any device that 
5 collects digital packets and converts them for audio reproduction. The device 10 is 
shown to include a jitter buffer 12, for temporary storage of packets received from the 
network. According to one aspect of the invention, associated with the jitter buffer are a 
number of variables identifying characteristics of the jitter buffer. The variables include 
Max _JB_Size 2, Upper_Threshold 4, Target_JB_Size 5, LowerJThreshold 6 and 

10 Min_JB_Size 8. The Max_JB Size is the physical maximum size of the jitter buffer, and 
the Min_JB_Size 8 is the physical minimum size of the jitter buffer. The Target_JB_Size 
5 identifies a desirable number of packets which should be stored in the jitter buffer at 
any time to provide desirable system performance in terms of packet delay and loss 
characteristics. The Target_JB_Size 5 may be initially set by the user. The Upper 

15 threshold is a selected upper bound size of the jitter buffer. When the size of the jitter 
buffer exceeds this threshold, the jitter buffer is approaching maximum capacity, thus 
approaching a maximum playback offset and packet loss scenario. Similarly, the 
LowerJThreshold is a selected lower bound size of the jitter buffer. When the number of 
packets in the jitter buffer falls below this threshold, there may not be enough packets in 

20 the jitter buffer to support a steady stream of audio output, thus introducing undesirable 
pops, crackles or delays in the voice transmission. 

One aspect of the present invention is the adaptive process which is used to ensure 
that the number of packets in the jitter buffer remains within the Upper and Lower 
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thresholds. In one embodiment, the adaptive process is controlled by the jitter buffer 
controller 16. As shown in Figure 1, the jitter buffer controller 16 controls access to the 
jitter buffer 12 by maintaining a current read point 24 and write pointer 20 to the buffer. 
The jitter buffer controller 16 also monitors the jitter buffer to identify a current buffer 
5 fullness 22, identifying a number of packets in the jitter buffer at a given time. The 
current buffer fullness 22 can be easily tracked by the controller monitoring a count of 
packets read into the buffer less a count of packets output from the buffer, although other 
methods can also be used. It can be updated on each read to or write from the jitter 
buffer. The jitter buffer controller monitors the current buffer fullness during a periodic 

10 interval. The periodic interval is selected to be short enough to capture highly variable 
packet arrival time. At the end of the periodic interval, the jitter buffer controller 16 
analyzes the fullness characteristics of the buffer during the time interval to determine 
whether the number of packets maintained in the buffer provides a desirable playback 
offset of the received signal to be maintained. 

15 After analyzing the fullness characteristics of the buffer for the interval, the jitter 

buffer controller forwards a value Delta on signal line 17 to packet transmit logic 25. 
The packet transmit logic 25 controls the playback of the packets from the jitter buffer. 
In one embodiment, the packet transmit logic increases the packet offset delay by 
inserting dummy packets into the transmission path. As the dummy packets are inserted 

20 into the receiver data stream, the jitter buffer fills to a desirable level. In another 

embodiment, the packet transmit logic selectively drops packets from the jitter buffer, in 
order to decrease the packet offset delay. As the network traffic and jitter buffer 
characteristics change during operation, the jitter buffer controller continually updates the 
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Delta value forwarded to the packet transmit logic. With such an arrangement, a system 
is provided that permits the jitter buffer size to be maintained within a desirable range to 
satisfy system performance requirements. The system of the present invention is a closed 
loop system which iteratively uses dynamic jitter buffer characteristics to adapt the jitter 
5 buffer size. Thus the present invention is easier to implement than several prior art 
systems which record round-trip packet transmission times, arrival rates, etc. 

Referring now to Figures 2 and 3, block diagrams illustrating exemplary steps that 
may be taken to adapt jitter buffer size using dynamic buffer characteristics will now be 
described. Figure 2 illustrates steps that are taken during a monitoring interval, while 

10 Figure 3 illustrates steps that may be taken to modify packet handling at the device in 
order to restore the jitter buffer to a desired threshold. 

In Figure 2, at step 40, several variables that are used by the jitter buffer controller 
are initialized. These variables are used during the monitoring process, and include 
operational limits defining the Target_Jitter_Buffer_Size, Upper_Threshold and 

15 LowerJThreshold values for the jitter buffer. In one embodiment, as mentioned above, 
the Target_Jitter_Buffer_Size indicates the desirable number of packets that should be 
maintained in the jitter buffer to ensure that desirable delay and packet loss performance 
is achieved. The Upper_Threshold is determined by applying an 'above_target' 
multiplier to the Target_Jitter_Buffer_Size to identify a maximum number of packets in 

20 excess of the target that are permissible. Similarly, the LowerJThreshold is determined 
by applying a 'below target' multiplier to the Target_Buffer_Size to identify a minimum 
number of packets below the target that are permissible. These values may be user 
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entered, or calculated and updated by a computer program in response to ongoing 
network traffic analysis. 

The jitter buffer controller also uses tracking variables to track the characteristics 
of the jitter buffer during the monitoring interval. The tracking variables include min_fill 
5 and maxjfill. The min_fill variable identifies the minimum number of packets that are in 
the jitter buffer during the monitoring interval, and the max_fill variable identifies the 
maximum number of packets that were stored in the jitter buffer during the monitoring 
interval. At initialization, the min_full variable is set equal to the Max_JB_Size value, 
and the max_fill variable is set equal to the Min_JB_Size value. 

10 After the variables are initialized at step 40, the monitoring process is started at 

step 42. As mentioned above, the current buffer fullness information is collected for the 
jitter buffer over the periodic monitoring interval. The current fullness data may be 
collected at a set time interval, or it may be collected each time a packet is written to the 
buffer, each time a packet is read from the buffer, or some combination thereof. The 

15 duration of the monitoring interval is a matter of design choice, depending upon the 

resources which are available in the design for storing and processing the current fullness 
data. 

At step 44, the current buffer fullness status is retrieved by the jitter buffer 
controller. At steps 46 and 48, it is determined, respectively, whether the current buffer 
20 fullness is less than the current min_fill value, and whether the current buffer fullness 

exceeds the current max_fill value. If it is determined that the current fullness is less than 
the current min_fill value, the min_full value is overwritten with the current fullness 
value at step 47. Alternatively, if it is determined that the current fullness is greater than 

10 
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the current max_fill value, the max_full value is overwritten with the current fullness 
value at step 49. 

At step 50, it is determined whether the monitoring period has ended. If not, at 
the process returns to step 44 to retrieve more current fullness data. When the monitoring 
5 interval ends, the process proceeds to step 52, where the collected data is analyzed by 
adaptive control logic in the jitter buffer control logic. 

Step 52 is shown in more detail in Figure 3. At step 60, it is first determined 
whether the physical capabilities of the jitter buffer have been reached. This is 
determined by comparing the min_full result against the Min_JB_Size value, and the 
10 max_full result against the Max_JB_Size value. If the values are greater than or equal to 
the physical maximum and minimum of the buffer, an Error alert is forwarded to the user. 

If the physical capabilities of the buffer have not been reached, then at step 64 a 
Delta value is calculated. The Delta value indicates a desired change in the playout time 
offset for the receiver. The Delta value can be calculated in a variety of ways using the 
15 known variables and retrieved min_full and max_fill values, and the present invention is 
not limited to any particular method of calculating the Delta. In short the Delta value 
calculation should use the variables to determine what changes need to be made at the 
receiver to result in jitter buffer fullness levels that approach the targetjitter_buffer 
fullness value. 

20 For example, if it is determined that the min_fill result obtained during the 

monitoring process greater than or equal to the Lower_Threshold variable value, the 
Delta values can be calculated using below Equation I: 
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Equation I: 

Deltamin = (Lower_Threshold - minJfill)/2; 
Alternatively, if it is determined that the min_fill result is less than or equal to the 
Lower_Threshold value, the Delta value can also be calculated using below Equation II: 

5 

Equation II: 

Deltamin = (LowerJThreshold - min_fill) 

The resulting Delta value of Equation II is larger than the Delta value resulting 
10 from Equation I, because there is no division by two. Thus, the action taken by the 

receiver to restore the jitter buffer to the desired capacity will occur more drastically with 
the Delta returned by Equation II, and thus playback offset delays will not be minimized 
at the expense of packet loss. 

The Delta value is then forwarded to the Packet Transmit logic. In an 
15 embodiment where the Delta value is used to indicate an inadequate number of packets in 
the jitter buffer, the Delta value is used to identify a number of dummy packets that are to 
be played out by the receiver. There are many method of inserting dummy packets into 
the data stream, and the present invention is not limited to any particular implementation. 
For example, the Delta value may map directly to the number of dummy packets, or 
20 number of dummy packets may be a multiple of the Delta value. The dummy packets 
may be immediately output when the Delta value is received, or interspersed with jitter 
buffer packets. 
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In an alternate embodiment, the Delta value may be calculated to identify a 
number of packets in the jitter buffer in excess of the UpperJThreshold. This calculation 
is shown in Equation HI: 

Equation III: 

5 

Deltamax = max_fill - UpperJThreshold. 

When using the Delta calculation of Equation m, the Delta number may be used 
to identify a number of packets to drop to bring the jitter buffer size to the desired level, 
10 and reduce the playback offset delay. 

When the Delta has been calculated, at step 66 the target_buffer__size value is 
updated to reflect the desired buffer size given current traffic conditions. In one 
embodiment, the Target_JB_Size is calculated according to Equation IV below: 

15 Equation IV: 

Target_JB_Size = max (Target_JB_Size + Delta, Lower_Threshold) 
The function 'max' assigns the Target_JB_Size with whichever value field is 
greater. Thus, in Equation IV, the Target_JB_Size can be modified, but will not be set to 
any value that is below the Lower_Threshold value. 
20 An alternative method of calculating an updated TargetJB JSize is provided 

below in Equation V: 
Equation V: 

13 
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Target_JB_Size = max ((max_fill - min_fill)/2 - (min_fill - Lower_Threshold), 
LowerJThreshold) 

In Equation V, the Target_JB_Size is updated to be equal to the average jitter 
5 buffer size during the monitoring period, less the difference between the min_fill and the 
Lower_threshold. Thus, the Target_JB_Size is reduced to a level that minimizes packet 
offset delay, while ensuring that no packet loss is incurred. 

Accordingly a system and method has been shown and described which controls jitter 
buffer fullness, and concomitantly system performance, through the use of periodic 
10 buffer characterizations. Adjustments to the packet delay offset are achieved through 
analysis of the dynamic buffer characteristics, and through insertion or deletion of 
packets in response to the characteristics. Periodic monitoring and adjustment helps to 
ensure that the buffer retains its desired fullness in the face of changing network traffic 
characteristics. The present invention thus uses local dynamic buffer characteristics to 
15 regulate jitter buffer fullness and hence system quality and performance with minimal 
complexity. 

The above description and Figures have included various process steps and 
components that are illustrative of operations that are performed by the present invention. 
However, although certain components and steps have been described, it is understood 
20 that the descriptions are representative only, other functional delineations or additional 
steps and components can be added by one of skill in the art, and thus the present 
invention should not be limited to the specific embodiments disclosed. In addition, the 
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present invention is not limited to a method for controlling buffer fullness in a VOIP 
network, but may be used to design any buffer capable of storing units of data. 

Figure 2 and 3 are flowchart illustrations of methods, apparatus (systems) and 
computer program products according to an embodiment of the invention. It will be 
5 understood that each block of the flowchart illustrations, and combinations of blocks in 
the flowchart illustrations, can be implemented by computer program instructions. These 
computer program instructions may be loaded onto a computer or other programmable 
data processing apparatus to produce a machine, such that the instructions which execute 
on the computer or other programmable data processing apparatus create means for 

10 implementing the functions specified in the flowchart block or blocks. These computer 
program instructions may also be stored in a computer-readable memory that can direct a 
computer or other programmable data processing apparatus to function in a particular 
manner, such that the instructions stored in the computer-readable memory produce an 
article of manufacture including instruction means which implement the function 

15 specified in the flowchart block or blocks. 

Those skilled in the art should readily appreciate that programs defining the 
functions of the present invention can be delivered to a computer in many forms; 
including, but not limited to: (a) information permanently stored on non-writable storage 
media (e.g. read only memory devices within a computer such as ROM or CD-ROM 

20 disks readable by a computer I/O attachment); (b) information alterably stored on 
writable storage media (e.g. floppy disks and hard drives); or (c) information conveyed to 
a computer through communication media for example using baseband signaling or 

15 
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broadband signaling techniques, including carrier wave signaling techniques, such as 
over computer or telephone networks via a modem. 

While the invention is described through the above exemplary embodiments, it 
will be understood by those of ordinary skill in the art that modification to and variation 
of the illustrated embodiments may be made without departing from the inventive 
concepts herein disclosed. Moreover, while the preferred embodiments are described in 
connection with various illustrative program command structures, one skilled in the art 
will recognize that the system may be embodied using a variety of specific command 
structures. Accordingly, the invention should not be viewed as limited except by the 
scope and spirit of the appended claims. 
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